Revising `check_model()` by strengejacke · Pull Request #698 · easystats/performance

strengejacke · 2024-03-18T10:55:01Z

Fixes #697

TODO or to check

wonky plot from check_model() on a glmmTMB example #654
Unnecessary checks in check_model for NB models #500
check_overdispersion underestimates dispersion in mixed models #464
Improving check_model() for GLMs #376
check_autocorrelation of residuals #274
detrend in Q-Q plots (DHARMa implementation for new check_residuals() function #643 (comment))
plot() for check_overdispersion()

Fixes #697

strengejacke · 2024-03-18T11:00:55Z

@bwiernik and @mccarthy-m-g - I think the implementation works quite well now for the first methods. The plot() method works fine for the Q-Q plots (easystats/see#329). Things we should consider:

We would have to update the plot for overdispersion/zero-inflation checks (see DHARMa implementation for new check_residuals() function #643 (comment)). This is still based on the classical residuals, not the simulated ones. I have played around with revising the current code, copying the function .diag_overdispersion into a new .new_diag_overdispersion (see

performance/R/check_model_diagnostics.R

Line 296 in ad60db1

.new_diag_overdispersion <- function(model, ...) {

). This did not really work - do you have any ideas how we can have new plots for overdispersion/zero-inflation? I found these plots quite informative and would like to keep them, beside the Q-Q plot. See also discussion here.
check_zeroinflation() and check_overdispersion() now rely on simulate_residuals() for zero-inflated or negative binomial models etc., so only the really "simple" models that returned identical results to DHARMa-tests still use the old code. Only Poisson mixed models also use the "old" code, which still could be inaccurate (see check_overdispersion underestimates dispersion in mixed models #464) - do we want to use simulate_residuals() in general for mixed models when calling check_zeroinflation() and check_overdispersion()?
How do we want to deal with y axis limits when detrend = TRUE in Q-Q plots? (see DHARMa implementation for new check_residuals() function #643 (comment))
A method for check_autocorrelation() is not yet implemented. Since the DHARMa tests require additional information, I'm not sure about how to best implement such methods.

codecov · 2024-03-18T11:17:38Z

Codecov Report

Attention: Patch coverage is 12.50000% with 7 lines in your changes missing coverage. Please review.

Project coverage is 57.73%. Comparing base (6e1eb60) to head (2be53b8).
Report is 3 commits behind head on main.

Files	Patch %	Lines
R/check_model_diagnostics.R	12.50%	7 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #698      +/-   ##
==========================================
+ Coverage   57.69%   57.73%   +0.03%     
==========================================
  Files          87       87              
  Lines        6444     6464      +20     
==========================================
+ Hits         3718     3732      +14     
- Misses       2726     2732       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

strengejacke · 2026-05-25T20:13:41Z

Results for overdispersion plots from implementation in #913 and CRAN version

library(lme4)
#> Loading required package: Matrix
library(glmmTMB)
library(dplyr) ## for mutate_at, %>%
#> 
#> Attaching package: 'dplyr'
#> The following objects are masked from 'package:stats':
#> 
#>     filter, lag
#> The following objects are masked from 'package:base':
#> 
#>     intersect, setdiff, setequal, union
library(performance)

#Build example data
x <- c("A", "B", "C", "D")
time <- rep(x, each=20, times=3) #time factor
y <- c("exposed", "ref1", "ref2")
lake <- rep (y, each=80)  #lake factor
set.seed(123)
min <- runif(n=240, min=4.5, max=5.5) #mins used in model offset
set.seed(123)
count <- rnbinom(n=240,mu=10,size=100) #randomly generated negative binomial data

#make data frame
dat <- as.data.frame(cbind(time, lake, min, count)) 
dat <- dat |> 
   mutate_at(c('min', 'count'), as.numeric)

#remove one combination of factors to make example rank deficient (all observations from time A and lake ref1)
dat2 <- filter(dat, time!="A" | lake !="ref1")

model <-glmmTMB(count~time*lake, family=nbinom1,
                      control = glmmTMBControl(rank_check = "adjust"),
                      offset=log(min), data=dat2)
#> dropping columns from rank-deficient conditional model: timeD:lakeref1
check_overdispersion(model) |> plot()

PR 919

CRAN version

set.seed(3)
mu <- rpois(500, lambda = 3)
x <- rnorm(500, mu, mu * 3) |> ceiling() |> pmax(0)

quine.nb1 <- MASS::glm.nb(x ~ mu)
check_overdispersion(quine.nb1) |> plot()
#> `geom_smooth()` using method = 'loess' and formula = 'y ~ x'
#> `geom_smooth()` using method = 'loess' and formula = 'y ~ x'

PR 913

CRAN version

set.seed(101)
d <- data.frame(x = runif(1000), f = factor(sample(1:200, size = 1000, replace = TRUE))) # modified for more random effects
suppressMessages(
  d$y <- simulate(
    ~ x + (1 | f),
    family = poisson,
    newdata = d,
    newparams = list(theta = 1, beta = c(0, 2))
  )[[1]]
)
m1 <- glmer(y ~ x + (1 | f), data = d, family = poisson)
check_overdispersion(m1) |> plot()

PR 913

CRAN version

docvisit <- datawizard::data_read("~/../Downloads/docvisit.txt")

mp <- glmmTMB(
  doctorco ~ sex + illness + income + hscore, 
  data = docvisit,
  family = poisson()
)
check_overdispersion(mp) |> plot()

PR 913

CRAN version

mnb <- glmmTMB(
  doctorco ~ sex + illness + income + hscore,
  data = docvisit,
  family = nbinom2()
)
check_overdispersion(mnb) |> plot()

PR 913

CRAN version

mzip <- glmmTMB(
  doctorco ~ sex + illness + income + hscore,
  ziformula = ~age,
  data = docvisit,
  family = poisson()
)
check_overdispersion(mzip) |> plot()

PR 913

CRAN version

mzinb <- glmmTMB(
  doctorco ~ sex + illness + income + hscore,
  ziformula = ~age,
  data = docvisit,
  family = nbinom2()
)
check_overdispersion(mzinb) |> plot()

PR 913

CRAN version

mzinbd <- glmmTMB(
  doctorco ~ sex + illness + income + hscore + age,
  ziformula = ~ sex + illness + income + hscore + age,
  dispformula = ~ sex + illness + income + hscore + age,
  data = docvisit,
  family = nbinom2()
)
check_overdispersion(mzinbd) |> plot()

PR 913

CRAN

^{Created on 2026-05-25 with reprex v2.1.1}

strengejacke · 2026-05-25T20:21:40Z

Most plots are very similar, but I think for some plots, #913 is an improvement. I don't see any disadvantages for PR #913

Revising check_model()

cb26681

Fixes #697

strengejacke assigned mccarthy-m-g, bwiernik and strengejacke Mar 18, 2024

strengejacke mentioned this pull request Mar 18, 2024

Revising check_model() #697

Open

docs

8137ac3

strengejacke added 2 commits March 19, 2024 13:32

Merge branch 'main' into strengejacke/issue697

3b72ffa

Merge branch 'main' into strengejacke/issue697

dcd6139

bbolker mentioned this pull request Mar 24, 2024

check_model failing on logistic regression #701

Closed

strengejacke added 19 commits March 25, 2024 09:21

version bump

0936c5e

Merge branch 'main' into strengejacke/issue697

2e8636f

Merge branch 'main' into strengejacke/issue697

cc913ac

version

d10c7c6

debug mode

6518a4d

Merge branch 'main' into strengejacke/issue697

0e29423

Merge branch 'main' into strengejacke/issue697

1ac50ea

Merge branch 'main' into strengejacke/issue697

7eb41af

Merge branch 'main' into strengejacke/issue697

535c419

Merge branch 'main' into strengejacke/issue697

c1eaa76

Merge branch 'main' into strengejacke/issue697

5fa72a2

Merge branch 'main' into strengejacke/issue697

dda4683

Merge branch 'main' into strengejacke/issue697

17b81ee

Merge branch 'main' into strengejacke/issue697

29e27bb

Merge branch 'main' into strengejacke/issue697

e1082a9

Merge branch 'main' into strengejacke/issue697

2c61214

Merge branch 'main' into strengejacke/issue697

2be53b8

Merge branch 'main' into strengejacke/issue697

20087d6

Merge branch 'main' into strengejacke/issue697

7cd93a8

strengejacke added 3 commits October 22, 2024 17:55

Merge branch 'main' into strengejacke/issue697

a8aa04d

Merge branch 'main' into strengejacke/issue697

af17c05

Merge branch 'main' into strengejacke/issue697

65efe0a

Copilot AI mentioned this pull request Oct 7, 2025

Add autocorrelation testing for simulated residuals and use simulated residuals for Poisson mixed models #860

Closed

Merge branch 'main' into strengejacke/issue697

f4c8c99

This comment was marked as outdated.

Sign in to view

Merge branch 'main' into strengejacke/issue697

da44521

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revising `check_model()`#698

Revising `check_model()`#698
strengejacke wants to merge 28 commits into
mainfrom
strengejacke/issue697

strengejacke commented Mar 18, 2024 •

edited

Loading

Uh oh!

strengejacke commented Mar 18, 2024

Uh oh!

codecov Bot commented Mar 18, 2024 •

edited

Loading

Uh oh!

strengejacke commented May 25, 2026 •

edited

Loading

Uh oh!

This comment was marked as outdated.

strengejacke commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

strengejacke commented Mar 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

strengejacke commented Mar 18, 2024

Uh oh!

codecov Bot commented Mar 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

strengejacke commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR 919

CRAN version

PR 913

CRAN version

PR 913

CRAN version

PR 913

CRAN version

PR 913

CRAN version

PR 913

CRAN version

PR 913

CRAN version

PR 913

CRAN

Uh oh!

This comment was marked as outdated.

strengejacke commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

strengejacke commented Mar 18, 2024 •

edited

Loading

codecov Bot commented Mar 18, 2024 •

edited

Loading

strengejacke commented May 25, 2026 •

edited

Loading